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Expression vectors, transfomied host cells and fermentation process for the production of 
recombinant polypeptides 

The present Invention relates to an expression vector, comprising a polynucleotide encoding 
a fusion protein comprising the signal sequence of the gac gene of Pseudomonas diminuta 
and a polypeptide of interest, a prokaryotic host cell transfonned with such an expression 
vector and a process for production of a polypeptide of interest using said host cell and said 
expression vector. 

It is a subject of the present invention to provide a process for the efficient and direct 
production of a mature recombinant polypeptide in a prokaryotic host cell. The process 
according to the present invention can, for example, favourably be used for the production of 
recombinant human Interferon alpha 2B (rhlFNa2B) in Escherichia coll (E. coli). 

In the production of recombinant proteins In prokaryotic microorganisms such as the 
expression of human or other eukaryotic proteins in bacterial cells it is often difficult to obtain 
a clearly defined N-terminus which is as neariy 100% homogeneous as possible. This 
applies in particular to recombinant pharmaceutical proteins whose amino acid sequence 
ought in many cases to be identical to the amino acid sequence naturally occurring in 
humans/animals. Any Inhomogeneity or deviation from the natural sequence is. however, 
unacceptable In many cases because these products frequently show different 
immunological (for example induction of antibody fomiation) and phamiacological (half-life, 
phamriacoklnetlcs) properties. For these reasons, It Is necessary in most cases to produce a 
nature-identical product (homogeneous and without foreign amino adds at the N-tenninus). 

On natural expression, for example in humans, many pharmaceutical proteins which are In 
use are transported into the extracellular space, and cleavage of the signal sequence 
present In the precursor protein for this purpose results in a cleariy defined N-terminus. Such 
a homogeneous N-terminus is not always easy to produce, for example in bacterial cells, for 
several reasons. 

The synthesis of all cytoplasmic proteins in prokaryotic microorganisms starts with a 
methionine due to the start codon ATG, which is both a translation initiation site and the 
codon for methionine. Depending on the stmcture of the second amino acid after the 
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N-terminal methionine, this methionine may be cleaved off by a host cell methlonlne- 
amlnopeptidase (MAP) leading to a mixture of a product starting either with Met or with the 
second amino acid. The separation of these two species Is very difficult and leads to 
reduced yields. In recombinant cytoplasmic production of a polypeptide It Is usual for a not 
Inconsiderable proportion (1-50%) of the polypeptide to remain unaffected by the MAP. 
Therefore, production of recombinant polypeptides using a cytoplasmic Met-expresslon 
system Is often highly unfavorable. 

Another possibility to produce a mature recombinant polypeptide via a cytoplasmic pathway 
is the production of an N-termlnal fusion protein with subsequent chemical or enzymatic in 
vitro deavage. However, In many cases N-terminus of the fusion protein is not easily 
accessible to enzymatic cleavage leading to low cleavage rates or no cleavage at all. This 
may be due to the N-temiinus being structurally Inaccessible. 

In addition, recombinant proteins are expressed in the cytoplasm of prokaryotic 
microorganisms in a reduced state with the effect that disulphlde bonds often necessary for 
correct protein folding and function are not fomied or not fonmed con-ectly. Recombinant 
polypeptides containing disulphlde bonds therefore may necessitate difficile in vitro 
oxidation. 

Also, cytoplasmic expression leads often to the fomiation of Inclusion bodies, which have to 
be solubilised under denaturing conditions followed by a refold to the native structure prior to 
the actual purification process. 

Periplasmic expression, on the other hand, could directly yield a product with the desired 
properties: 

(i) Con-ect, mature N-terminus by cleaving off of a periplasmic signal sequence by the host 
ceirs signal peptidase apparatus 

(ii) Soluble expression due to conrect folding 

(ill) Conrect disulphlde bond fomiation due to oxidative milieu in the periplasm very similar 
to the one found in the human extracellular fluid (where this molecule Is naturally found) 
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Accofdjngly, it Is therefore an object of the present Invention to provide a periplasmic 
expression system which is suitable for the production of a recbmbinant polypeptide of 
interest in a prolcaryotic host cell. 

It has now surprisingly been found within the context of the present Invention that an 
expression vector encoding the signal sequence of the glutaryl 7-ACA acyiase gene (gac 
gene) of Pseudomonas diminuta and a polypeptide of interest is particularly suitable for use 
In a process for the recombinant production of the polypeptide. 

In one aspect, the present invention thus relates to an expression vector comprising a 
polynucleotide encoding a fusion protein comprising the signal sequence of the gac gene of 
Pseudomonas diminuta and a polypeptide of Interest other than gac of Pseudomonas 
diminuta, wherein said signal sequence and said polypeptide of interest are linked in such a 
way that upon expression of the polynucleotide In a suitable host cell the signal sequence is 
cleaved off the fusion protein and the polypeptide of Interest is released Into the periplasm of 
the host cell. 

In accordance with the present Invention, a variety of polypeptides of interest can be 
produced by utilization of the expression vector. For example, the polypeptide of interest can 
be selected from the group consisting of an interfieron. an interieukin, a growth hormone, a 
growth factor, a cytokine, an enzyme, an enzyme inhibitor, an antibody and an antibody 
fragment, and the like, for example interferon alpha 2A, Interferon alpha 2B. lnterieukln-3, 
interleukin-6, human growth hormone. Insulin, granulocyte-colony stimulating factor, 
granulocyte macrophage-colony stimulating factor, macrophage-colony stimulating factor, 
interferon beta 1, bovine somatropin, porcine somatropin, lnterieukin-11, interieukln-2, a 
Fab-fragment, and small peptides such as calcitonin, parathyroid hormone (PTH). era 
glucagon. Preferably, within the scope of the present Invention, the polypeptide of interest Is 
a recombinant human Interferon 2, in particular human interferon alpha 2A or human 
interferon alpha 2B, the latter being partlculariy prefen^d to be the polypeptide of interest. 

With respect to the polynucleotide encoding the part of the fusion protein which is the 
polypeptide of interest, a cDNA or a synthetic polynucleotide can be used. If a given cDNA or 
synthetic polynucleotide corresponding to a natural occuning gene sequence will be 
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expressed only poorly, the structure of the cDNA or synthetic polynucleotide may be adapted 
to the respective host cell by codon optimization. 

In this respect, within a prefenred embodiment of the present invention, the polynucleotide 
encoding the polypeptide of interest being rhlFNa2B, will comprise the following nucleotide 
sequence (SEQ ID NO 1) 

5'-TGC GAT CTG CCG CAA ACC CAC AGC CTG GGT AGC CGG CGA ACC TTG 



ATG 


GTT 


CTG 


GCA 


GAG 


ATG 


CGG 


CGA 


ATC 


TCT 


GTT 


TTC 


TCT 


TGC 


TTA 


AAG 


GAT 


CGA 


CAT 


GAG 


TTC 


GGT 


TTC 


CCG 


GAG 


GAG 


GAG 


TTC 


GGT 


AAC 


GAG 


TTG 


CAA 


AAG 


GGT 


GAA 


ACC 


ATC 


CCG 


GTA 


TTG 


CAT 


GAG 


ATG 


ATC 


GAG 


GAG 


ATG 


TTC 


AAC 


CTG 


TTC 


AGC 


ACT 


AAG 


GAC 


TCT 


TCT 


GCT 


GCT 


TGG 


GAT 


GAG 


ACC 


CTG 


GTT 


GAC 


AAA 


TTC 


TAG 


ACT 


GAA 


CTG 


TAG 


CAG 


GAG 


CTG 


AAC 


GAG 


CTG 


GAA 


GCC 


TGC 


GTG 


ATC 


GAG 


GGT 


GTG 


GGT 


GTG 


ACT 


GAG 


ACT 


CCG 


CTG 


ATG 


AAG 


GAG 


GAC 


TCT 


ATT 


CTG 


GCT 


GTG 


CGA 


AAA 


TAG 


TTC 


CAA 


CGG 


ATC 


ACT 


CTG 


TAT 


CTG 


AAA 


GAG 


AAG 


AAA 


TAG 


AGG 


CCG 


TGG 


GGC 


TGG 


GAG 


GTT 


GTC 


CGA 


GCA 


GAA 


ATC 


ATG 


CGG 


TCT 


TTG 


TCT 


TTG 


TCT 


ACC 


AAC 


TTG 


CAA 


GAA 


TCT 


TTA 


CGA 


AGC 


AAG 


GAA-3 • , 



where 48 out of 165 codons of the natural occurring human interferon alpha 2B have been 
altered with respect to the nucleotide sequence without changing the amino acid sequence. 

With respect to the Pseudomonas diminuta strain, any Pseudomonas diminuta strain which 
exhibits a glutaryl-7-aminocephalosporic acid acylase activity will bear a gac gene encoding 
a suitable signal sequence. 

For example, such a Pseudomonas diminuta strain has been described in Czech Patent No. 
CZ 278,515 under designation no. CCM 3987. 
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Such a Pseudomonas dimlnuta strain bears the gene encoding the enzyme glutaryl-7-ACA- 
acylase, abbreviated gac. 

A prefen-ed embodiment of the present invention reiates to the expression vector according 
to the present invention, wherein said signal sequence of the gac gene of Pseudomonas 
diminuta, forming part of the said fusion protein, comprises the amino acid sequence (SEQ 
ID NO 2) 

MLRVLHRAAS ALVMATVI GLAPAVAFA . 

For example, such amino add sequence may be encoded by the following polynucleotide 
sequence (SEQ ID NO 3) 

5>ATG CTG AGA GTT CTG CAC CGG GCG GCG TCC GCC TTG GTT ATG GCG 
ACT GTG ATC GGC CTT GCG CCC GCC GTC GCC TTT GCG-3 ' 

Likewise, polynucleotide sequence where nucleotides have been altered as to create 
restriction enzyme site can be used as long as such mutations are silent, i.e. they do not 
change the amino acid sequence of the gas signal sequence as outlined above. 

Accordingly, another example for a suitable polynucleotide sequence encoding the gac 
signal sequence may be (SEQ ID NO 4) 

5' -ATG CTG AGA GTT CTG CAC CGG GCG GCG TCC GCC TTG GTT ATG GCG 
ACT GTG ATC GGC CTT GCG CCC GCG GTC GCC TTT GCG-3', 

wherein, compared to the polynucleotide sequence mentioned above, a single mutation 
(C->G) has been introduced with respect to the last nucleotide of codon 23. in order to 
obtain a restriction enzyme site (Sac II site) which may be used in further cloning. 

Another preferred embodiment of the present invention relates to the expression vector 
according to the present invention, wherein said vector is a plasmid. A low (about 1-10 
copies per cell), medium (about 10-50 copies per cell) or high (about > 50 copies per cell) 
copy plasmid may be used. These definitions apply if such a plasmid will be used in a 
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favourable environment, and such copy numbers may be lower under certain fermentation 
conditions, e.g. under reduced temperature conditions. For strong inducible systems a 
medium copy basic replicon (e.g. pBR322) may be adaquate, for strong basal expression 
from low or medium strengtli expression elements (promoter, RBS, structural gene) a high 
copy replicon (e.g. plasmids from the pUC series) may be better suited. In the context of the 
present invention, in particular with respect to production of rhlFNa2B, a high-copy plasmid 
is preferably used. Accordingly, a further preferred embodiment of the present invention 
relates to an expression vector according to the present invention, wherein said vector is a 
high copy plasmid. 

Further elements of the expression vector according to the present invention comprise 
transcription and translation elements, in particular a promoter and a ribosomal binding site 
(RBS). 

For periplasmic expression, the ribosomal binding site should neither be too strong nor too 
weak. The first could lead to an overstrain of the periplasmic protein export apparatus 
(translocase etc.) and the deposition of (often insoluble) uncleaved ftjsion protein in the 
cytoplasm. The second could lead to poor protein yields. The situation on the transcriptional 
level (promoter) is similar to the one on the translational level (RBS). Transcription should 
neither be too strong nor to weak in order to avoid the described phenomena. Also, the 
sudden onset of protein production when using an inducible promoter may cause problems 
with "clogging" the export apparatus or will at least require extensive fine-tuning of the 
induction step and fermentation parameters. This will often lead to non-robust processes. 

It has been found that in the context of the present invention the promoter region and the 
ribosomal binding site of the gac gene of Pseudomonas diminuta are particulariy suitable for 
serving as transcriptional and translational regulatory elements. 

Therefore, a preferred embodiment comprises an expression vector according to the present 
invention, wherein said vector further comprises a polynucleotide comprising the promoter 
region and the ribosomal binding site of the gac gene of Pseudomonas diminuta, which 
polynucleotide being operatively linked to the polynucleotide encoding the fusion protein 
comprising the signal sequence and the polypeptide of interest. 
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In a further preferred embodiment thereof said polynucleotide comprising the promoter 
region and the ribosomai binding site comprises the nucleotide sequence (SEQ ID NO 5) 

5 ' -ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 
CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC-3' 

In a yet further preferred embodiment thereof said polynucleotide comprising this promoter 
region and the ribosomai binding site comprises the nucleotide sequence (SEQ ID NO 6) 

5 ' -TCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGAC 
CTGGGCAATGGGATGTAGAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCGT 
ACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCGTCT 
CCGGATCTGGGTGAGAGGGGAAATCC-3 ' • 

Further elements may be present in an expression vector according to the present invention, 
as appropriate. 

For example, an expression vector according to the present invention may comprise a 
polynucleotide comprising one or more transcription terminator(s). 

Likewise, an expression vector according to the present invention may comprise a 
polynucleotide encoding one or more selectable mariners, e.g. to provide for antibiotic 
resistance of a transformed host cell. Suitable selectable markers are widely known in the 
art. In the context of the present invention, the expression vector favourably comprises a 
polynucleotide comprising a tetracycline resistance gene. 

Additionally, where suitable, further regulatory elements may be present on an expression 
vector according to the present invention. Regulatory elements are widely known In the art, 
like a repressor or an enhancer. 

In a further aspect, the present invention relates to prokaryotic host cells which are 
transformed with an expression vector according to the present invention in order to be 
capable of bring about expression of the polypeptide of interest. 

Therefore, the present invention relates to a prokaryotic host cell transformed with an 
expression vector which is compatible with the host cell, said vector comprising a 
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polynucieotide encoding a fusion protein comprising the signal sequence of the gac gene of 
Pseudomonas diminuta and a polypeptide of interest other than gac of Pseudomonas 
diminuta, wherein said signal sequence and said polypeptide of interest are liniced in such a 
way that upon expression of the polynucleotide in a suitable host ceil the signal sequence is 
cleaved off the fusion protein and the polypeptide of interest is released into the periplasm of 
the host cell. 

Examples for suitable polypeptides of interest are those as mentioned above. A prefenred 
embodiment thereof relates to such a prokaryotic host cell, wherein the polypeptide of 
interest is an interferon alpha 2. in particuliar, such interferon alpha 2 is selected from the 
group consisting of interferon alpha 2A and interferon alpha 2B, the latter being prefenned. 

Another preferred embodiment thereof relates to a host ceil according to the present 
invention, wherein said vector is a piasmid, preferably a high copy plasmid. 

The present invention further relates to a host ceil according to the present invention, 
wherein said signal sequence of the gac gene of Pseudomonas diminuta comprises the 
amino acid sequence (SEQ ID NO 2} 

MLRVLHRAASALVMATVIGLAPAVAFA. 

In a further aspect, the present invention relates to the host cell according to the present 
invention, wherein vector further comprises a polynucleotide comprising the promoter region 
and the ribosomal binding site of the gac gene of Pseudomonas diminuta, which 
polynucleotide being operatively linl<ed to the polynucleotide encoding the fusion protein 
comprising the signal sequence and the polypeptide of interest. 

Such polynucleotide comprising the promoter region and the ribosomal binding site 
preferably comprises the nucleotide sequence (SEQ ID NO 5) 

5 ' -ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 
CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC-3 ' • 
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in a further preferred embodiment tliereof, such polynucleotide comprising the promoter 
region and the ribosomal binding site comprises the nucleotide sequence (SEQ ID NO 6) 

5 ' -TCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGAC 
CTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCGT 
ACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCGTCT 
CCGGATCTGGGTGAGAGGGGAAATCC-3 ' . 

With respect to prokaryotic host cells, preferably said host cell according to the present 
invention is a GRAM-negative bacterial cell. Preferably such a bacterial cell Is selected from 
the group consisting of Escherichia coli (E. coli). Pseudomonas sp., Enterobacter sp., 
Campylobacter sp., and Vitreoscllla sp, E.coli being particularly preferred. Preferably, 
derivatives of E. coli K 12 will be used because such strains have a long history of safe use 
and are particularly suitable for periplasmatic expression. Lilcewise, other types of E. coii, 
e.g. E. coli B derivatives, may be used. 

In a preferred embodiment of the present invention, the prokaryotic host cell is derived from 
E. coli W31 10 (ATCC 27325). Such a strain has been deposited with the American Type 
Culture Collection (ATCC). 10801 University Boulevard. Manassas, VA 20110-2209, USA on 
February 28, 2001, under the Designation No. PTA-3132. 

E. coli W31 1 0 (ATCC 27325) and the deposited strain ATCC PTA-3132 can genetically be 
characterized as follows: 

Escherichia coli K-12 [F" mcrA mcrB IN(rmD-n"nE)l lambda"]. 

A further aspect of the present invention relates to a process for production of a polypeptide 
of interest, comprising 

(i) providing a prokaryotic host cell transfomied with an expression vector which is 
compatible with the host cell, said vector comprising a polynucleotide encoding a fusion 
protein comprising the signal sequence of the gac gene of Pseudomonas diminuta and a 
polypeptide of interest other than gac of Pseudomonas diminuta. wherein said signal 
sequence and said polypeptide of interest are linked in such a way that upon expression of 
the polynucleotide in a suitable host cell the signal sequence is cleaved off the fusion protein 
and the polypeptide of Interest is released Into the periplasm of the host cell, and 
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(ii) culturing the prokaryotic host cell under conditions which cause expression of the 
polynucleotide whereby upon fonnation of the fusion protein the signal sequence is cleaved 
off the fusion protein and the polypeptide of interest is released into the periplasm of the host 
cell. 

Optionally, the process according to the present invention further comprises isolation of the 
polypeptide of interest. 

Examples for suitable polypeptides of interest are those as mentioned above. A preferred 
embodiment thereof relates to such a prokaryotic host cell, wherein the polypeptide of 
interest is an interferon alpha 2. In particular, such interferon alpha 2 is selected from the 
group consisting of interferon alpha 2A and interferon alpha 2B. the latter being prefened. 

Another prefenred embodiment thereof relates to the process according to the present 
invention, wherein said vector is a plasmid, preferably a high copy plasmid. 

The present Invention further relates to a process according to the present invention, 
wherein said signal sequence of the gac gene of Pseudomonas diminuta comprises the 
amino acid sequence (SEQ ID NO 2) 

MLRVLHRAASALVMATVIGLAPAVAFA. 

In a further aspect, the present invention relates to the process according to the present 
invention, wherein said vector further comprises a polynucleotide comprising the promoter 
region and the ribosomal binding site of the gac gene of Pseudomonas diminuta, which 
polynucleotide being operatively linked to the polynucleotide encoding the fusion protein 
comprising the signal sequence and the polypeptide of interest 

Such polynucleotide comprising the promoter region and the ribosomal binding site 
preferably comprises the nucleotide sequence (SEQ ID NO 5) 

5 ' -ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 
CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC-3 ' . 
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In a further preferred embodiment thereof, such polynucleotide comprising the promoter 
region and the ribosomal binding site comprises the nucleotide sequence (SEQ ID NO 6) 

5 ' -TCTAGACCAACAACATCTTCAACGTCTACCCGACCAAGATTCAGGAGCCGTCGGCCGAC 
CTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCGT 
ACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGCGTCT 
CCGGATCTGGGTGAGAGGGGAAATCC-3' . 

With respect to prokaryotic host cells, preferably said host cell according to the present 
invention is an E. coii cell; implications of that have been described in detail above. 

In a further embodiment the present invention relates to the cultivation (or femientation) part 
of the process utilizing the transfomied host cell according to the present invention. 

Many factors can influence the productivity of fermentation processes employing 
recombinant organisms. The applied fermentation strategy has to consider the sensitive 
relationships between microbial physiology and plasmid copy number, plasmid stability and 
gene expression \Artiich is not only determined on gene's level but also by media composition 
and process conditions. Besides the stability of the host strain, stability of the product is of 
great importance for high level production of recombinant proteins. 

For the development of a fermentation process employing a constitutive expression system, 
gro\Arth conditions and recombinant product formation are closely related to each other. 
Therefore the effort for optimal growth and control of growth conditions which are in a close 
relationship to product formation during the whole fermentation run is higher compared to 
induced expression systems. 

Typical E.coli fermentation processes for the production of recombinant proteins are 
characterized by short fermentation times in a range between a few hours and about 100 
hours of cultivation. Most often not only a carbon source but also various complex or 
inorganic nitrogen sources, different salts and trace elements are fed to E.coli cultures. 
Carbon feeding usually follows time-profiles with a stepwise increase or exponential increase 
of the feeding rates. Sometimes also mixtures of carbon sources are used during E.coli 
fermentations. Feeding of the carbon source is usually coupled to the oxygen transfer 
capacity of the bioreactor by the use of various control strategies. 
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A further object of the present invention is to provide a robust and reproducible fermentation 
process, based on the gac expression system and connesponding transformed host cells 
acconiing to the present invention, for high yield periplasmatic expression of a polypeptide of 
interest without the use of extensive nitrogen source feeding and optionally vnthout the 
addition of trace elements to the fermentation media. 

The process according to the invention is carried out in principle by initially cultivating the 
bacterial host cell, i.e. the expression strain, In accordance with microbiological practice 
known per se. The strain is generally brought up starting from a single colony on a nutrient 
medium, but it is also possible to employ cryopreserved cell suspensions (cell banks). The 
strain is generally cultivated in a multistage process In order to obtain sufRcient biomass for 
further use. 

On a small scale, this can taka place in shaken flasks, wherein it is possible in most cases to 
employ a complex medium (for example LB broth). However, it is also possible to use 
defined media (for example citrate medium). For the cultivation, a smali-voiume precuiture of 
the host strain (inoculated with a single colony or with a ceil suspension from a cryoculture) 
is grown, the temperature for this cultivation not generally being critical for the later 
expression result, so that it is possible routinely to operate at relatively high temperatures 
(for example SCO or ST'C). The main culture is set up in a larger volume (for example 
500 ml), where It is in particular necessary to ensure good aeration (large volume of flask 
compared with the volume of contents, high speed of rotation). Since it Is intended that 
expression takes place in soluble fomn, the main culture will in most cases also be carried 
out at a somewhat lower temperature (for example Tl'C or as'C). Both inducible systems 
(for example with trp, lac, tac or phoA promoter) and constitutive systems (like the prefened 
system of the present invention comprising the gac promoter) are suitable for producing 
soluble proteins. The resulting cells can be harvested and processed further. 

On a larger scale, the multistage system consists of a plurality of bioreactors (fermenters), 
preferably employing defined nutrient media in order to be able to improve the process 
engineering control of the process or employing complex nutrient media in order to enhance 
growth of the microorganism and to increase robustness of the process. In addition, it is 
possible to greatly increase biomass and product fonnation by feeding of particular nutrients 
(fed batch mode). For example, a preliminary stage fennenter and a main stage femienter 
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are used. The preliminary stage fermenter is inoculated with a so-called inoculum which is 
generally grown from a single colony or a cryoculture in a shalce flask. Good aeration must 
also be ensured In the fermenter - and especially in the main stage thereof. The resulting 
cells are once again delivered for further processing. 

Accordingly, in a preferred embodiment the culturing (or cultivation) of the process according 
to the present invention as described herein is being performed as a multi-stage process 
comprising a pre-culture step and a main culture step. In the alternative, a single-step 
fermentation without a pre-culture step is possible. In an even more prefenred embodiment, 
the process according to the present invention as described herein is being performed as a 
multi-stage fermentation process comprising a shake-flask step, optionally a pre-culture 
step, and a main-culture step. 

In particular, said culturing of the procaryotic host cell in the main culture step is perfonned 
in a culture medium comprising a substrate for more than about 90% of the cultivation time 
at a substrate concentration lower than the saturation constant of the substrate, 
accompanied by high levels of dissolved oxygen concentration, and further accompanied by 
a steadily decreasing specific growth rate of the bacterial host cells, the process being 
peri'ormed at a temperature which is lower than the optimum temperature for growth of the 
host cell. 

In this context, the saturation constant is the concentration of a substrate (in particular the 
carbon source) at which the host cell is growing at a specific growth rate which is equivalent 
to 50% of the maximum specific growth rate. 

In this context, the specific growth rate Is the increase of biomass concentration in a certain 
time interval divided by the average biomass concentration of said time interval. 

Preferably, the culture medium of the main culture as well as, where applicable, of the pre- 
culture, is a complex medium comprising a complex nitrogen source, preferably a yeast 
extract, various salts and a carbon source to support initial growth of the host cell. In a 
preferred embodiment thereof, said carbon source is either added to the main culture 
medium by feeding of said carbon source after inoculation or, preferably, present in the main 
culture at the time of inoculation. 
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Preferably, the concentration of dissolved oxygen In the main culture step is higher than 
about 20%. more preferably from about 40 % up to about 100% of saturation. 

Preferably, the steadily decreasing growth rate In the main culture step Is from about 2h"^ to 
about 0.001 h"\ 

In a preferred embodiment, the temperature In the main culture step is between about 22**C 
and about 35**C, preferably between about 25^*0 and about 31 ^C, most preferably about 
28*^0. 

In another preferred embodiment, said cultivation in the pre-culture and/or main culture is 
perfomied at a pH value in the range of about 6.7 to about 7.3. 

In a further preferred embodiment of the present invention, the substrate Is glycerol or, 
preferably, a carbohydrate. Preferably, the carbohydrate is glucose. 

As mentioned herein, examples for suitable proteins of interest are those as mentioned 
above. A preferred embodiment, the polypeptide of Interest is an interferon alpha 2. In 
particular, such interferon alpha 2 is selected from the group consisting of interferon alpha 
2A and interferon alpha 2B. the latter being prefenred. 

In a most prefenred embodiment reganJing ail aspects of the processes of the present 
invention, the host cell is an E. coli cell. 

The polypeptide of interest can then be isolated by protein purification methods known to the 
skilled person (see. for example. M.P. Deutscher. in: Methods in Enzymology: Guide to 
Protein Purification. Academic Press Inc., (1990). 309-392). A purification sequence 
generally comprises a cell dismption step, a clarification step (centrifugation or 
microfiltratlon) and various chromatographic steps, filtrations and/or precipitations. A suitable 
example for the isolation of a polypeptide of interest produced in accordance with the 
present Invention is given below. 
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Deposition of Microorganisms: 

E. col! strain W31 1 0 (ATCC 27325) has been deposited with the American Type Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, USA on 
February 28, 2001, under the Designation No. PTA-3132. 

The following examples serve to illustrate the present invention, without in any way limiting 
the scope thereof. Subject-matter disclosed in the examples relates to prefenred 
embodiments of the present Invention. 

Examoles 

Example 1: Construction of a host cell strain for production of recombinant human interferon 
alphq 2B (rhlFNa2B) 

1.1 General considerations 

The polypeptide rhiFNa2b (recombinant human lnterferon-(x2b) is produced in the 
Escherichia coll K-12 strain \A/31 10 transformed with a plasmid containing an optimized 
synthetic gene coding for rtiIFNa2b. rhlFNa2b is produced under the control of the promoter 
and Ribosome Binding Site (RBS) of the glutaryl 7-ACA acylase gene (gac) from 
Pseudomonas diminuta CCM 3987 by femnentation of recombinant E. coli K-12. rhlFNa2b is 
expressed as an N-terminal fusion protein with the signal sequence from the same (gac) 
gene, directing the protein to the periplasm with concunrent processing (cleaving off) of the 
signal sequence. The fermentation process therefore directly yields mature rhlFNa2b with a 
primary sequence identical to that of naturally occurring human Interferon alpha 2b. The 
expression plasmid is designated pMG414, the production strain W31 10[pMG414]. 

1.2 Construction of expression vector pMG414 

pUC19 serves as the starting point for the construction of the vector plasmid. pUC19 is a 
frequently used and thoroughly characterized high copy plasmid. It contains a highly efficient 
origin of replication and an ampicillin resistance (amp orbia) gene (Yanisch-Perron at al., 
1985; Vieira and Messing, 1982; GehBank accession numbers L09137 and X02514). 
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Even though pUC19 is frequently used for the construction of expression plasmids, the amp 
gene may not be an ideal selectable marker for industrial purposes. For this reason the 
pnamoter and the coding region of the amp gene are removed and replaced by the promoter 
and the coding region of the tetracycline resistance gene (teQ from the well Iviown safety 
plasmid pBR322 (Bolivar et al., 1977a, 1977b. 1978; review: Balbds et al., 1986; GenBanIc 
accession numbers J01749. K00005, L08654, Ml 0283, Ml 0286. M10356, Ml 0784, 
M1 0785, M10786, M33694, V01 1 19). This cloning woik Is performed with the help of high 
fidelity PCR techniques. 

To achieve this, the fragment spanning bps 1743 to 679 of pUC19 is amplified using high 
fidelity PCR (Pwo DNA Polymerase system from Roche Biochemicals) and the following 5'- 
phosphorylated oligonucleotides: 

Oligo 235: 5'- Phosphate - TAACTGTCAG ACCAAGTTTA CTC -3' (SEQ ID NO 7) 
Oligo 236: 5'- Phosphate - GCGTTTCGGT GATGACGGTG -3' (SEQ ID NO 8) 

The resulting PGR fragment is 1624 bps in length and contains the complete pUC19 
backbone lacking the amp promoter and coding sequence, but Including the stop codon and 
transcription terminator from the amp gene. 

As mentioned above, the tet promoter and coding sequence (excluding the stop codon) is 
amplified from pBR322. Again, high fidelity PCR was used to amplify bps 4 to 1273 of 
pBR322. The following 5 '-phosphorylated oligonucleotides were used for this amplification: 

Oligo 237: 5'- Phosphate - TCATGTTTGA CAGCTTATCA TCG -3' (SEQ ID NO 9) 
Oligo 238: 5 - Phosphate - GGTCGAGGTG GCCCGGCTC -3' (SEQ ID NO 10) 

The resulting PCR fragment is 1270 bps In length. The two PCR fragments are purified by 
preparative agarose gel electrophoresis and ligated using T4 DNA Ligase (Rapid DNA 
Ligation Kit, Roche Biochemicals). The ligated DNA Is purified and electroporated Into 
Escherichia coli K-12 DH10B (Life Technologies ElectroMAX DH10B electrocompetent cells, 
genotype: F mcrA A{m/r-/7sdRMS-/77C/6S) <|)80d/acZAM15 A/acX74 deoR recfK^ endAI 
araD139 A(ara, /eu)7697 galU galK X' rpsL nupG). Transformed cells are plated on to LB 
agar 15 mg/L tetracycline and 3 g/L glucose. Liquid cultures are grown In LB broth 
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containing 15 mg/L tetracycline and 3 g/L glucose and plasmid DNA is isolated from these 
cultures using standard minlprep methods. Plasmid DMAs are analyzed by restriction 
analysis for con-ect integration of the fef fragment into the pUCig backbone. Since 
integration of the fragment was unspecific with respect to orientation, only about 50% of all 
insert containing clone had the fragment inserted in the con-ect orientation, i.e. the tet gene 
running in the same direction as the the amp gene in pUC19. A larger amount of DNA is 
isolated from liquid cultures of a few clones and subjected to more detailed restriction 
analyses. Of those clones showing correct restriclton patterns, one is selected for further 
cloning work. 

The respective plasmid was designated pMG402. It is identical to pUC19 in all features and 
functions but for the fact that it must be grown on/In tetracycline-containing media instead of 
ampicillin-containing media. This way a tet resistant high copy vector suitable for industrial 
purposes is generated. 

Features of plasmid pMG402: 

bps 1 954-680: pUCI 9 backbone pUC1 9 lacking the amp promoter and stmctural gene) 
bps 681-1 953: tet promoter and structural gene from pBR322 

rhlFNa2b is expressed as an N-temiinal fusion with the signal sequence of glutaryl 7-ACA 
acylase from Pseudomonas diminuta COM 3987 (gaciss = SEQ ID NO 2) directing the 
protein to the periplasm with concurrent processing (cleaving off) of the signal sequence by 
the host ceirs signal peptidase apparatus. 

Amino acid sequence of gaciss (27 aa): IVILRVLHRAAS ALVMATVIGL APAVAFA 

In the 3' region of the coding sequence of the gaciss a Sac II restriction endonuclease site 
is introduced via the 3' PGR primer creating a silent mutation (amino acid sequence 
unchanged). This Sac II site allows fusion of the gadss coding region with the rhlFNa2b 
gene. 

The structural gene for rtilFNa2b is synthesized chemically. It differs from the natural human 
cDNA sequence in 48 of 165 codons and is designed to eliminate any weak and enror prone 
codons. 
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In the following table, codon changes are indicated. In the table, "Natural cbdon" refers to 
the cDNA sequence published by Streuli et aL, 1980 (GenBank Accession Number V00548). 
The amino acid numbers refer to mature hlFN(x2b (starting with Cys 1). The amino acid 
sequence to be encoded by the synthetic gene Is taken from the SwissProt Database, 
Accession number P01563/P01564 (amino acids 24 to 188). 



Exchange 


Amino 


Natural 


Synthetic 


Exchange 


Amino 


Natural 


Synthetic 


no. 


acid 


codon 


codon 


no. 


acid 


codon 


codon 


1 


Cysi 


TGT 


TGC 


25 


Leu 80 


CTC 


CTG 


2 


Pro 4 


CCT 


CCG 


26 


Leu 81 


CTA 


CTT 


3 


Arg 12 


AGO 


CGG 


27 


Leu 88 


CTC 


CTG 


4 


Arg 13 


AGG 


CGA 


28 


Asn 93 


AAT 


AAC 


5 


Leu 17 


CTC 


CTT 


29 


Cys 98 


TGT 


TGC 


6 


Arg 22 


AGG 


CGG 


3D 


lie 100 


ATA 


ATC 


7 


Arg 23 


AGA 


CGA 


31 


Gly 102 


GGG 


GGT 


8 


Ser28 


TCC 


TCT 


32 


Gly 104 


GGG 


GGT 


9 


L^u 30 


7TG 


TTA 


33 


TTir106 


ACA 


ACT 


10 


Asp 32 


GAC 


GAT 


34 


Prolog 


CCC 


CCG 


11 


Arg 33 


AGA 


CGA 


35 


Sen 15 


TCC 


TCT 


12 


Phe36 


TTT 


TTC 


36 


Arg 120 


AGG 


CGA 


13 


Gly37 


GGA 


GGT 


37 


Arg 125 


AGA 


CGG 


14 


Phe38 


TTT 


TTC 


38 


Leu 128 


CTC 


CTG 


15 


Pro 39 


CCC 


CCG 


39 


Pro 137 


CCT 


CCG 


16 


Phe43 


TTT 


TTC 


40 


Cys 138 


TGT 


TGC 


17 


Gly44 


GGC 


GGT 


41 


Arg 144 


AGA 


CGA 


18 


Pro 54 


CCT 


CCG 


42 


Arg 149 


AGA 


CGG 


19 


Val55 


GTC 


GTA 


43 


Phe 151 


TTT 


TTC 


20 


Leu 56 


CTC 


TTG 


44 


Ser154 


TCA 


TCT 


21 


Asn 65 


AAT 


AAC 


45 


Thr 155 


ACA 


ACC 


22 


Leu 66 


CTC 


CTG 


46 


Ser 160 


AGT 


TCT 


23 


Thr69 


ACA 


ACT 


47 


Arg 162 


AGA 


CGA 


24 


Ser72 


TCA 


TCT 


48 


Ser 163 


AGT 


AGC 
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The resulting gene allows efficient and precise transcription and translation of rhiFNa2b in 
Escherichia coll Since the gene is designed for expression in a bacterial system it does not 
contain any untranslated sequences (introns etc.). 

The structural gene is chemically synthesized. In brief, overlapping complementary 
oligonucleotides about 30 to 50 nucleotides in length are synthesized in a way to cover both 
strands of the structural gene sequence without any gaps. The oligonucleotides are 
hybridized to each other and iigated using T4 DMA Ligase. The reaction product is cut with 
restriction endonucleases and cloned Into the pUC18 vector. The resulting plasmid Is 
sequenced and shows the correct sequence. 

The synthetic gene on this plasmid does not contain the gac signal sequence. This part of 
the coding region is introduced via the gac fragment containing promoter, RBS and signal 
sequence and fused to the rhlFNa2b structural gene. 

The gac fragment is generated by chemical synthesis. For example, overlapping 
complementary oligonucleotides about 30 to 50 nucleotides in length are synthesized in a 
way to cover the full length of both strands of the gac fragment (including the restriction 
endonuclease recognition sites on both sides plus a minimum of 6 additional basepairs to 
allow efficient cleavage) without any gaps. The oligonucleotides are then hybridized to each 
other (e.g. by heating and subsequent cooling) and Iigated using T4 DMA Ligase. The 
reaction product is then cut with the respective restriction endonucleases {Xba I and EcoR I) 
and cloned into the pMG402 vector (see below). 

In the alternative, the gac fragment containing promoter, RBS and signal sequence can be 
amplified from a plasmid comprising such elements like plasmid pKS55, which construction 
is described In CS patent No. 278,515. The gac gene cloned therein has been derived from 
a strain of Pseudomonas dimlnuta ^CCM 3987). Amplification Is carried out using a high 
fidelity PGR system. The restriction endonuclease sites needed for cloning are Introduced 
via the following PGR primers. 
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Primers: 

1. 5'-Phosphate - GGGGGGTCTAGACCAACAACATCTTCAACGTCTACC -3' 
(SEQ ID NO 11) 

2. 5'-Phosphate - CC CCC CGA ATT CAC TAG TAG GCG TCT CTC TCC -3' 
(SEQ ID NO 12) 

There will be no dKference in performance betvA/een a gac fragment generated via high 
fidelity PGR amplification and a gac fragment generated by chemical synthesis. 

The thus created gac fragment has the following nucleotide sequence (SEQ ID NO 13): 

5 ' -G6GGG GTCTAGA CCAACAACATCTTCAAC6TCTACCC6ACCAAGATTCAGGAGCCGTCGG 

CCGACCTGGGCAATGGGATGTACAGCGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCT 

GGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCA 

ACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCCATGCTGAGAGTTCTGCACCGGGCGG 

CGTCCGCCTTGGTTATGGCGACTGTGATCGGCCTTGCGCCCGCGGAGAGAGACGCGTACT 

AGT GAATTC GGGGGG-3 ' 

The gac fragment (either synthetic or created via PGR) and the vector plasmid pMG402 are 
Itgated using the Xba I and EcoR I sites. This way the expression vector pMG412 is 
generated. 

The expression vector, pMG412, contains codons 1 - 23 + the first nucleotide of codon 24 of 
the gac signal sequence. Into codons 22-24 the Sac II site is Introduced by silent mutation. 
Anything downstream of the Sac II site in pMG412 is primer or vector sequence. 

The last two nt of codon 24 + codons 25-27 are introduced by the fonArard PGR primer for 
the target structural gene (rhlFNo&2B, see above). Such a primer therefore contains the 
following elements: 

Cutting overhang (e.g. 6 nucleotides) - Sac II site - to gcc ttt gcg (SEQ ID NO 14) - 
hybridizing region corresponding to the 5' end of the "mature'* target gene. 
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In particular, a suitable primer has the following nucleotide sequence (SEQ ID NO 15): 
TT GCG C CC GCG GT C GCC TTT GCG - hybridizing region (Sac II underilned) 
The last amino acids (24-27) of the gac signal sequence are V A F A (SEQ ID NO 16). 



From the plasmid construct described above the rhlFNa2b gene is amplified using a high 
fidelity PGR system. The 5' PGR primer contains the Sac II site for fusing the gene with the 
srac fragment plus the last four codons of the gac signal sequence. The 3' primer contains 
the TAA (ochre) stop codon and the Mlu I site for cloning. The amplification of the Interferon 
alpha structural gene generates the following fragment (SEQ ID NO 17): 



GGGGGGCCGCGGTCGCCTTTGCGTGCGATCTGCCGCAAACCCACAGCCTGGGTAGCCGGC 
GAACCTTGATGCTTCTGGCACAGATGCGGCGAATCTCTCTTTTCTCTTGCTTAAAGGATC 
GACATGACTTCGGTTTCCCGCAGGAGGAGTTCGGTAACCAGTTCCAAAAGGCTGAAACCA 
TCCCGGTATTGCATGAGATGATCCAGCAGATCTTCAACCTGTTCAGCACTAAGGACTCTT 
CTGCTGCTTGGGATGAGACCCTGCTTGACAAATTCTACACTGAACTGTACCAGCAGCTGA 
ACGACCTGGAAGCCTGCGTGATCCAGGGTGTGGGTGTGACTGAGACTCCGCTGATGAAGG 
AGGACTCTATTCTGGCTGTGCGAAAATACTTCCAACGGATCACTCTGTATCTGAAAGAGA 
AGAAATACAGCCCGTGCGCCTGGGAGGTTGTCCGAGCAGAAATCATGCGGTCTTTCTCTT 
TGTCTACCAACTTGCAAGAATCTTTACGAAGCAAGGAATAA TACGCGT GAATTCGGGGGG 

This rtilFN<x2b PGR fragment and pMG412 are iigated using the Sac II and Miu I sites. This 
way the final production/expression plasmid pMG414 was generated. Both strands of 
pMG414 are sequenced and show no differences to the expected sequence. 

Features of plasmid pMG414 (total size 3668 bps): 

bps 2728-256: pUG1 9 backbone, part 1 

bps 257-546: gac fragment (promoter, RBS, signal sequence) 

bps 547-1 044: synthetic rtiiFNa2b gene (including TAA stop) 

bps 1045-1454: cloning sites + pUG19 backbone, part 2 

bps 1455-2727: tet gene from pBR322 (promoter/RBS 1455-1536, 
coding sequence including TAA stop 1537-2727) 



r • 

WO 2005/017162 



PCT/EP2004/OG9067 



-22- 

Thereby, the gac fragment containing promoter, RBS and signai sequence is fused to the 
rhlFN(x2b structural gene - using a restriction endonuclease site at the 3' end of the gac 
fragment introduced by a PGR primer. The same site is fused to the 5' end of the riilFNa2b 
structural gene, also by the way of a PGR primer So after cloning both elements (gac 
fragment and rhlFN(x2b structural gene) into the basic vector a gene encoding a gadss- 
riilFNa2b fusion protein is generated. Of its total 192 codons (576 nucleotides) the first 27 
encode the gac signal sequence not present in the final protein and amino acids 28 to 192 
encode mature riilFNa2b (165 amino acids, cysteine 1 to glutamic acid 165) 



The nucleotide sequence of the expression cassette used in the rhlFNa2b expression 
piasmid pl\/IG414 (807 bps) (see below) and amino acid sequence of the gac1ss-rhlFNa2b 
fusion protein Is shown as follows (SEQ ID NO 18): 



5'HTCTAGA|CCAACAACATCTTCAACGTCTACCCGACCAAGATTCA6GAGCCGTCGGCCGACC 



TGGGCAATGGGATGTACAGGGGGCTTGCGCCGTTCGGCTTCACCGGCGGATCCTGGTTCG 
TACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCGCTGCAACGGGG 
TCTCCGGATCTGGGTGAGAGGGGAAATCC 



AT6 CTG AGA GTT CTG CAC CGG GCG GCG TCC GCC TTG GTT ATG GCG 
MLRVLH RAASALVMA 



ACT GTG ATG GGC GTT GGG C |CC GCG"^ TG GGG TTT GCG 
T V I G L A P A V A F A 



TGC gat CTG CCG CAA ACQ CAC 
G D L P Q T H 

ATG CTT CTG GCA CAG ATG CGG 
M L L A Q M R 

AAG GAT CGA GAT GAC TTG GGT 
K D R. D F G 

GAG TTG CAA AAG GGT GAA ACC 
Q F Q K A E T 

GAG CAG ATG TTC AAG GTG TTG 
Q Q I F N L F 



AGG CTG GGT AGC GGG CGA ACC TTG 
SLGSRRTL 

CGA ATG TGT GTT TTG TCT TGC TTA 
RISLFSGL 

TTC CCG CAG GAG GAG TTC GGT AAG 
FPQEEFGN 

ATG CGG GTA TTG CAT GAG ATG ATG 
IPVLHEMI 

AGG ACT AAG GAC TGT TCT GGT GGT 
STKDSSAA 
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TGG GAT GAG ACC CTG 
W D E T L 

CAG CTG AAC GAC CTG 
Q L N D L 

ACT GAG ACT CCG CTG 
T E T P L 

AAA TAC TTC CAA CGG 
K Y F Q R 

AGC CCG TGC GCC TGG 
S P C A W 

TTC TCT TTG TCT ACC 
F S L S T 



CTT GAC AAA TTC TAC 
L D K F y 

GAA GCC TGC GTG ATC 
E A C V I 

ATG AAG GAG GAC TCT 
M K E D S 

ATC ACT CTG TAT CTG 
I T L Y L 

GAG GTT GTC CGA GCA 
E. V . V R A 

AAC TTG CAA GAA TCT 
N L Q E S 



ACT GAA- CTG TAC CAG 
T E L y Q 

CAG GGT GTG GGT GTG 
Q G V G V 

ATT CTG GCT GTG CGA 
I L A V R 

AAA GAG AAG AAA TAC 
K E K K Y 

GAA ATC ATG CGG TCT 
E I M R S 

TTA CGA AGC AAG GAA 
L R S K E 



T lACGCGIj ACTAGT lGAATTCf -S' 



The sequence as shown is divided Into sul>paragraphs / regions which connprise: 

1. the gac promoter and RBS (first paragraph, bps 257 to 465 of pMG414. see below), 

2. the gac signal sequence coding region (second paragraph, bps 466 to 546 of p!^G414, 
see below), 

3. the synthetic gene for rhlFNa2b (third paragraph, bps 547 to 1044 of pMG414 (see 
below) - including the TAA stop codon). and 

4. the 3' cloning linker (fourth paragraph, bps 1045 to 1063 of pMG414, see below). 

On the pMG414 these four regions are directly joined to one another. They are separated in 
the figure for reasons of lucidity only. 

The start (ATG) and the stop (TAA) codons of the open reading frame are shown in bold. 
The first (TGC) an the last (GAA) codon of mature rhlFNa2b.are underlined. 



The restriction endonudease sites used for cloning are boxed. These are: 
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. Xba I (TCTAGA) and EcoR I (GAATTC) for the introduction of the gac fragment 

(promoter, RBS, signal sequence, Sac 11, Mlu I, Spe I sites) 
- Sac II (CCGCGG) and Mlu I (ACGCGT) for the introduction of the rhlFNa2b PGR 

fragment (including four codons for the last four amino acids of the gadss, the 495 bp 

synthetic gene for mature rhlFNa2b, and the TAA(T) stop codon). 

The gac promoter shows high constitutive / basal activity, the addition of a chemical Inducer 
or a physical stimulus (change in culture conditions) is not required. 

1.3 Cloning and establishment of the recombinant cell line 

The expression plasmid pMG414 is introduced into the host strain ATCC PTA-3132 
(=W3110 (ATCC 27325)) by eiectroporation. Electrocompetent cells are prepared according 
to a standard protocol, eiectroporation is carried out in 0.1 mm cuvettes at 1800 V using an 
Eppendorf Electroporator 2510. 

After eiectroporation the reaction is suspended in liquid medium and plated onto agar plates 
containing tetracycline. 

Starting point for selection of a suitable cell clone Is a thus obtained transformation plate. 
Various clones from this plate are grown in liquid culture and cryopreserved as research cell 
banks. Their productivity is tested in shake flask experiments and compared. The best clone 
(El/1 16) is used for further development. 

The best clone may show good productivity but relatively poor growth. This poor growth can 
result from various factors, e.g. product toxicity to the host ceil, metabolic burden due to 
product synthesis etc. The addition of glucose often brings some improvement because 
glucose downregulates (e.g. by catabolite repression) many promoters used for recombinant 
protein expression. Also, glucose has a general positive effect on the growth of E. coli 
because it can be directly introduced into the metabolism as a carbon source. 

In the case of E1/1 16, a clear positive effect of glucose on growth is observed. The best 
results are achieved with glucose concentrations between 2 and 5 g/L. To adapt the cell line 
to cope with product formation and consequently to better growth in the absence of glucose, 
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the strain is therefore grown in liquid medium in slial<e flaslcs for several passages ("shake 
flasl< cascade"). 

More specifically, a cryoviai of El/116 is thawed and the cell suspension streaked onto 
glucose free LB agar plates containing tetracycline. The plate is incubated at ST^'C until the 
colonies reach a sufficient size for Inoculating a liquid culture. Colonies are transferred from 
the plate into small shake flasks filled with 15 mL of glucose free LB broth containing 
tetracycline. The cultures are shaken at 37''C until they reach an optical density at 600 nm of 
above 0.5 (typically > 1 .0). For this first round this takes up to 48 hours due to the poor 
growth characteristics of the original isolate. 

The procedure described in4he above paragraph is performed five consecutive times with 
the liquid culture of the previous round being streaked onto plates and the colonies from the 
plates serving to inoculate the next liquid culture. From each liquid culture optical density is 
determined and a sample was taken for determination of product titer using SDS-PAGE - 
Westem Blot. Clones from the culture with the best combination of growth and productivity 
are the used to initiate the next round. 

In the course of the different rounds of this culture cascade (i.e. multiple propagation and 
reisoiation steps) the growth characteristics of the (sub)strain(s) gradually improve. By 
choosing the strain with the best combination of growth and productivity in each round, titers 
are also gradually increased. After the fifth round again single colonies are generated on LB 
agar plates containing tetracycline and used to inoculate a liquid culture containing 
tetracycline for the generation of a primary seed lot (PSL). The culture is grown at ZTC to 
an optical density of about 1 .5, mixed with an equal amount of sterile 40% w/v glycerol, 
aliquoted into cryogenic vials and frozen at -60^*0. This PSL is used as a starting point for 
the generation of the GMP cell banks (Master Cell Bank and Woricing Cell Bank) of the 
Interferon alpha 2b production strain. This reisolate is designated El/1 16a. Reisoiation 
processes like the one described above have proved to yield reproducible results. 

El/1 16a shows excellent growth characteristics in shake flasks and stirred bioreactors 
(fennenters). An Inoculum suitable for starting a bioreactor can be grown in a shake flask 
starting from a Working Cell Bank vial in about 8 hours. 
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A Master Cell Bank is prepared under cGMP conditions from the primary seed lot described 
above. In brief, a PSL vial is thawed and plated onto tetracycline containing agar plates. A 
single colony is picked and used to inoculate the Master Cell Bank (MCB) shake flask culture 
(LB broth medium containing tetracycline). The cell supension from the logarithmic growth 
phase is mixed 1+1 with 40 % w/v Glycerol, aliquoted at 1 .8 mL into cryogenic vials, sealed 
in cryogenic tubing, and frozen in the liquid phase of a liquid nitrogen tank. 

The Working Cell Bank is generated in the same way as the Master Cell Bank except that 
the shake flask culture is inoculated with cell suspension from a thawed MCB vial. 



Example 2: Fermentation process for production of recombinant human interferon alpha 2B 
(rhlFNa2B) 

The fermentation process is started by growing the strain E.coli K-12 W31 10 obtainable from 
the Working Cell Bank as described above in shake flask cultures in Luria Bertani (LB-) 
medium at SZ^'C with the addition of the antibiotic tetracyclin hydrochloride to avoid growth of 
non-plasmid carrying cells. 

The shake flask culture is then used to inoculate the seed culture (= pre-cuiture) medium 
(inoculum size = 0,4%). The medium for this pre-culture cultivation is based on deionized 
water containing glucose as a sole carbon source and yeast autolysate as a complex 
nitrogen source. In addition, anorganic salts like KH2PO4, K2HPO4, (NH4)2S04 and 
MgS04.7H20 are added to the medium. As an antifoam agent polypropylene glycole 2000 
(PPG2000) is used. In particular, the pre-culture medium has the following composition: 



Pre-culture Basal Medium: 



Component 


Amount 


De-ionized water (WBI) 


301 


Yeast autolysate, KAT, Ohiy 


21.7 g/l 


Glucose Monohydrate, pure 


25.0 g/l 


Ammonium Sulfate, p.a. 


1.0 g/l 


Potassium Phosphate Monobasic, p.a 


1.5 g/l 
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Component 


Amount 


Potassium Phosphate Dibasic, anhydrous, pure 


3.0 g/l 


Magnesium Sulfate Heptahydrate, p.a. 


0.5 g/l 


Polypropylene Glycole 2000 


0.5 ml/1 



These media components are sterilized together for 20 minutes at 121 X. After cooling of the 
basal medium, an aliquot of a 5 g/L sterile stoclc-^olution of the antibiotic Tetracycline 
Hydrochloride is added to the basal medium (sterilization is performed by filtration (0.22 pm 
filter)). 



Stock-solution Tetra^cline Hydrochloride (5 g/L): 



Component 


Amount 


Tetracycline Hydrochloride cryst, Ph.Eur. 


15mg/l 


De-ionized water (WBi) 





The cultivation-time for seed culture Is about 16 hours. During cultivation of the seed culture 
pH-value is controlled to a set-point of 7,0 ± 0,2 with sulfuric add and sodium hydroxide or 
concentrated ammonia solution. Concentration of dissolved oxygen Is kept at levels higher 
than 20% of saturation by increasing the stirrer speed. Stirrer speed at the beginning of the 
cultivation is set to 300 rpm, back-pressure in the vessel to 0,3 bar and aeration rate is 
controlled to 30 L/min (equivalent to "1 wm"). Temperature is kept constantly at ST^'C during 
cultivation. As a transfer criterion of broth to the main stage of the fermentation process, an 
increase of the dissolved oxygen concentration after consumption of the carbon source is 
used. 

For main culture cultivation a medium based on deionized water, glucose as a carbon source 
and yeast autolysate as a complex nitrogen source is used. Besides the addition of the 
anorganic salts (NH4)2S04, CaCl2.2H20 and MgS04.7H20, PPG 2000 is used as an antifoam 
agent. The initial glucose is sterilized separatly and added to the sterile rest of the medium. 
Inoculum size to the main fermenter medium was in a range between 0,75 and 3%. In 
particular, the main culture medium has the following composition: 



Main Culture Basal Medium: 
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Amount 


De-ionized water (WBI) 


601 


Yeast autolysate, KAT. Olily 


43.5 g/l 


Ammonium Sulfate, p.a. 


1.0 g/l 


Calcium Chloride Dihydrate cryst., p.a. 


0.3 g/l 


Magnesium Sulfate Heptahydrate, p.a. 


1.0 g/l 


Polypropylene Glycols 2000 


0.5 mi/I 



These media components are sterilized together for 20 minutes at 121 ''C. After 
cooling, an aliquot of a 800 g/L separately heat-sterilized glucose stock-solution is added to the 
main culture basal medium (sterilization is performed for more than 30 minutes at 120 ""C). 



Glucose Stock-solution, 800 g/i: 



Component 


Amount 


Glucose Syrup 


12.5 ml/I 


De-ionized water (WBI) 





The most important point during this cultivation is the necessity of a complete consumption 
of the initial glucose present in the medium. This leads to a sharp increase of dissolved 
oxygen concentration after about 9 hours of growth. By starting glucose feeding before total 
consumption of the initial glucose, no product formation is observed. Glucose limitation 
controlled by the feeding of the glucose-solution at a constant rate is therefore very 
important. The temperature during cultivation is controlled to a constant value of about 28''C. 
The initial stirrer speed is set to 300 rpm, the aeration rate is controlled to 100 L/min 
(equivalent to "1 wm") and the back-pressure in the vessel is set to 0,3 bar. The pH-value Is 
controlled to 7,1 ± 0,3 with sulfuric acid and sodium hydroxide or concentrated ammonia 
solution. A peak of the pH-value up to 8.0 after consumption of the initially supplied glucose 
is acceptable. 

The concentration of the dissolved oxygen is controlled to values higher than 20% of 
saturation. Dependent on the oxygen transfer capacity of the bioreactor DO-concentration is 
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kept at levels higher than 20% of saturation, preferably between about 40 % and 100% of 
saturation, by first increasing the stirrer speed to a maximum value. If this is not sufRclent, 
first aeration rate and after that back-pressure is increased, respectively. After a cultivation 
time between 48 and 192 hours (linear increase of product formation Is observed with 
cultivation time) the culture is harvested and cooled to 15 ± 5 ""C and conditioned for 
downstream processing by the addition of sucrose/EDTA to the cooled broth. 

The results of a fermentation batch is analysed based on the Westemblot technique or on 
HPLC-measurements after laboratory or pilot plant periplasmatic extraction of the product. 

Example 3: Ceil disruption and extraction 

A fermentation broth obtained as described above containing host cells comprising the 
expressed interferon alpha 2B in the periplasmic space is adjusted with sulfuric acid to pH of 
5,0 ±0,1 immediately after the fermentation and cooled down to 4''C ± 2 ^"0. The low pH and 
the low temperature help to inactivate endogenous proteases. 

The fermentation broth is adjusted to 10**C to 20''C, then without any concentration or 
washing of the cells, solid or liquid sucrose (200g sucrose/kg fermentation broth) and EDTA 
(concentration lOmM) are added and the pH adjusted to 8.0, After a selective one-step cell 
permeation protocol using osmotic shock (1+3 dilutions) with cooled water, whereby the 
fermentation broth comprising sucrose and EDTA is poured or pumped into the cooled 
(temperature about 4''C) water, the released periplasmic extract is clarified. 
Polyethyleneimine is added to a final concentration of 0,05 % and the pH is adjusted to 
about 7,5 with acetic acid. After 15 to 45 minutes cell debris and DNA flocculate, leaving a 
clear crude extract containing interferon which may be subject to centrifugation to improve 
clarity. 

This procedure leads to a clear periplasmic extract comprising the desired interferon alpha 
2B in high yield with a purity of > 20 % with respect to the total protein content. 
Polyethyleneimine helps to separate the cell debris from the soluble protein extract leading 
to a very pure Interferon solution. 
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Example 4: Chromatographic purification of recombinant human interferon a 2B (rhlFNa2B) 

4.1 Capture by Cation Exchange Chromatography (CEX) 

After pH adjustment to 4,8 - 5.2 with acetic acid and a filtration step using a 0.3 micron filter, 
the caide extract of Example 3 is applied to the CEX column (S ceramic HyperD F 
(Biosepra)). After a washing step with an equilibration buffer (20mM sodium acetate and 
70mM NaCI at pH 5.0) the interferon is eluted with a step gradient at 175mM NaCI. The 
fraction collected is immediately processed by the process step of Example 4.2. 

4.2 Anion Exchange (AEX) Chromatography 

The fraction from Example 4.1 is adjusted to a pH of 7.3 to 7.7 with sodium hydroxide, 
diluted and purified with water to a conductivity of 3.5 to 4.5 mS/cm and applied to the AEX 
column (Q ceramic HyperD F (Biosepra)). After washing, the interferon is eluted with a linear 
salt gradient (0-300mM NaCI) at about 150mM NaCI. Fractions are collected that have a 
purity of greater than or equal to 90 area % according to IPC reversed-phase HPLC and 
used directly in the next step (see Example 4.3). 

4.3 Hydrophobic interaction Chromatography (HIC) 

The fraction of Example 4.2 is diluted (1:1) with a stock solution of sodium sulphate (0.5% 
sodium sulphate), adjusted to pH 7.3 to 7.7 with NaOH or HCI and applied to the HIC column 
(Source 15PHE (Pharmacia). After washing, the interferon fraction of Example 3 is eluted 
with a linear sodium sulphate concentration (800 - 0 mM sodium sulphate) at about 400mM 
sodium sulphate. The fractions collected that have a purity of greater than or equal to 93 
area % and no impurity greater than or equal to 3% according to IPC reversed-phase HPLC 
are used directly in the next purification step. 

4.4 Cation Exchange Chromatography (CEX) 

The collected fractions of Example 4 are diluted with water to a final conductivity of 7,5 to 8.5 
niS/cm, adjusted to pH 4.3 to 4.7 with 99 to 100% acetic acid and applied to the CEX column 
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(T oyopearl SP-650 S (TosoHaas)). After a washing step, the interferon is eluted in a linear 
NaCI gradient (0 - 300 mM NaCI) at about 250mM NaCI. The fractions are collected that 
have a purity of greater than or equal to 95 area % and no impurity greater than or equal to 
3% according to IPC reversed-phase HPLC and are used directly in thb next purification 
step. 

4.5 Size Exclusion Chromatography 

The last purification step is a gel filtration step to remove dimers and other aggregates and 
to perform a buffer exchange for the final formulation. The Superdex 75 pg used in this step 
shows a good resolution even at a high load volume (5 % - 15 %). The SEC is performed in 
25mM sodium phosphate and 130miyi NaCI + O.SmM EDTA at a pH of about 7.3 to 7.7. 

The fractions with the highest purity (> 95 % main pealc in RP-HPLC and no side pealc > 3 
%) are pooled to give the final bulk solution comprising the desired recombinant human 
interferon a 2B in pure form in a high yield. 



